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Abstract. Brlek and Reutenauer conjectured that any infinite word u with language closed 
under reversal satisfies the equality 2_D(u) — ^^f!^ru(n) in which -D(u) denotes the defect of 
u and Tu(n) denotes Cu(n + 1) — Cu(n) + 2 — Vu{n + 1) — Pu(n), where Cu and Vu are the 
^ ■ factor and palindromic complexity of u, respectively. This conjecture was verified for periodic 

words by Brlek and Reutenauer themselves. Using their results for periodic words, we have 
recently proved the conjecture for uniformly recurrent words. In the present article we prove 
the conjecture in its general version by a new method without exploiting the result for periodic 
words. 

1. Introduction 

Brlek and Reutenauer conjectured in |[6| a nice equality which combines together the factor 
complexity Cu, the palindromic complexity Vu, and the palindromic defect D{u) of an infinite 
word u. It sounds as follows. 

Brlek-Reutenauer Conjecture. // u is an infinite word with language closed under reversal, 
then 

2D{u) = ^TM, 

n=0 

where Tu{n) = Cu{n + 1) - Cu(n) + 2 - Vu{n + 1) - Vu{n) . 

Brlek and Reutenauer proved ibidem that their conjecture holds for periodic infinite words. 
It is known from [7] that the Brlek-Reutenauer conjecture holds for words with zero defect. 
In |3], we proved the conjecture for uniformly recurrent words. In our proof, we constructed for 
any uniformly recurrent word u whose language is closed under reversal a periodic word v with 
language closed under reversal such that -D(u) = D(v) and Tu(n) = T^{n) for any n. Then we 
used validity of the conjecture for periodic words. 

In this paper, we will prove that the Brlek-Reutenauer conjecture holds in full generality 
without exploiting the result for periodic words. Since both sides of the equality in the Brlek- 
Reutenauer conjecture are non-negative, validity of the conjecture will be shown if we prove the 
following two theorems. 

Theorem 1. // u is an infinite word with language closed under reversal such that both D{vl) 
and Y^^^QTu{n) are finite, then 

(1) 2D{u)=J2Tu{n). 

n=0 

Theorem 2. If u is an infinite word with language closed under reversal, then 



D(u) < +00 if and only if 7u(n) < +oo . 



n=0 
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In the paper [3] which is devoted mainly to the uniformly recurrent words, we already stated in 
the section Open problems one part of Theorem[2l namely that D{u) < +00 implies Xln^ ^ul'^) < 
+00 . As pointed out in 0], there is a gap in our proof, and its corrected version can be found 
in [2]. In order to make the present paper self-sustained so that the reader understand and check 
all steps of the proof without having all previous papers at hand, we recall necessary notations 
and statements together with the proofs of the essential ones. 

2. Preliminaries 

By A we denote a finite set of symbols called letters; the set A is therefore called an alphabet. 
A finite string w = wqWi . . . Wn-i of letters from A is said to be a finite word, its length is 
denoted by \w\ = n. Finite words over A together with the operation of concatenation and the 
empty word e as the neutral element form a free monoid A* . The map 

W = WqWi . . .Wn-l ^ W = Wn-lWn-2 ■ ■ ■ Wq 

is a bijection on A*, the word w is called the reversal or the mirror image of w. A word w which 
coincides with its mirror image is a palindrome. 

Under an infinite word we understand an infinite string u = uqUiU2 ... of letters from A. 
A finite word w \s a factor of a word v (finite or infinite) if there exist words p and s such that 
V = pws. li p = e, then w is said to be a prefix of v, if s = e, then w is a suffix of v. 

The language C{v) of a finite or an infinite word v is the set of all its factors. Factors of v of 
length n form the set denoted by Cn{v). We say that the language of an infinite word u is closed 
under reversal if C{\i) contains with every factor w also its reversal w. 

For any factor w S >C(u), there exists an index i such that it; is a prefix of the infinite word 
• • •. Such an index is called an occurrence of w in u. If each factor of u has infinitely 
many occurrences in u, the infinite word u is said to be recurrent. It is easy to see that if the 
language of u is closed under reversal, then u is recurrent (a proof can be found in [9]). For 
a recurrent infinite word u, we may define the notion of a complete return word of any w S >C(u). 
It is a factor v € >C(u) such that w is a prefix and a suffix of v and w occurs in v exactly twice. 

If any factor w G >C(u) has only finitely many complete return words, then the infinite word 
u is called uniformly recurrent. 

The factor complexity of an infinite word u is the map Cu : N 1-^ N defined by the prescription 
Cii{n) := i^Cn{u). To determine the first difference of the factor complexity, one has to count 
the possible extensions of factors of length n. A right extension oi w £ 'C(u) is a letter a G A 
such that wa S ^(u). Of course, any factor of u has at least one right extension. A factor w is 
called right special if w has at least two right extensions. Similarly, one can define a left extension 
and a left special factor. We will deal mainly with recurrent infinite words u. In such a case, any 
factor of u has at least one left extension. 

In it is shown that any finite word w contains at most distinct palindromes (including 

the empty word). The defect D{w) of a finite word w is the difference between the utmost number 
of palindromes + 1 and the actual number of palindromes contained in w. 

In accordance with the terminology introduced in |8], the factor with a unique occurrence in 
another factor is called unioccurrent. 

The following corollary gives an insight into the birth of defects. 

Corollary 3 ([8j). The defect D{w) of a finite word w is equal to the number of prefixes w' of 
w for which the longest palindromic suffix of w' is not unioccurrent in w' . In other words, if b 
is a letter and w a finite word, then D{wh) = D{w) + 6, where 6 = if the longest palindromic 
suffix of wb occurs exactly once in wb and 6 = 1 otherwise. 
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Corollary [3] implies that D{v) > D{w) whenever is a factor of v. It enables to give a rea- 
sonable definition of the defect of an infinite word (see [5j). 

Definition 4. The defect of an infinite word u is the number (finite or infinite) 

D{vl) = sxi^{D[w) : w is a prefix of u} . 

Let us point out two facts. 

(1) If we consider all factors of a finite or an infinite word u, we obtain the same defect, i.e., 

D{u) = sup{-D(u;) : w G C{u)} . 

(2) Any infinite word with finite defect contains infinitely many palindromes. 
Using Corollary |3] and Definition HI we obtain immediately the following corollary. 

Corollary 5. Let u be an infinite word with language closed under reversal. The following 
statements are equivalent. 

(1) The defect of u is finite. 

(2) There exists an integer H such that the longest palindromic suffix of any prefix w of length 
\w\ > H occurs in w exactly once. 

For the longest palindromic suffix of a word w we will sometimes use the notation lps{w). 

The number of palindromes of a fixed length occurring in an infinite word is measured by the 
so called palindromic complexity Vu, the map which assigns to any non-negative integer n the 
number 

■= #{w £ Cn{u) : w \s & palindrome} . 

Denote by 

Tu(n) = Cu(n + 1) - Cu(n) + 2 - V^{n + 1) - Vu{n). 
The following proposition is proven in p] for uniformly recurrent words, however, as also noted 
in [6], the uniform recurrence is not needed in the proof and it holds for any infinite word with 
language closed under reversal. 

Proposition 6 ([!]). If u is an infinite word with language closed under reversal, then 
(2) T^{n) > for all n G N. 

Let u be an infinite word with language closed under reversal. Using the proof of Proposition |6l 
those n G N for which Tu(n) = can be characterized in the graph language. Before doing that 
we need to introduce some more notions. 

An n-simple path e is a factor of u of length at least n + 1 such that the only special (right 
or left) factors of length n occurring in e are its prefix and suffix of length n. If w is the prefix 
of e of length n and v is the suffix of e of length n, we say that the n-simple path e starts in 
w and ends in v. We will denote by G„(u) an undirected graph whose set of vertices is formed 
by unordered pairs (w,w) such that w G >Cn(u) is right or left special. We connect two vertices 
{w,W) and {v,v) by an unordered pair (e, e) if e or e is an n-simple path starting in w oi w and 
ending in v or v. Note that the graph G„(u) may have multiple edges and loops. 

Lemma 7. Ifu is an infinite word with language closed under reversal andn G N, then T^i{n) = 
if and only if both of the following conditions are met. 

(1) The graph obtained from G„,(u) by removing loops is a tree. 

(2) Any n-simple path forming a loop in the graph G„(u) is a palindrome. 

Proof. It is a direct consequence of the proof of Theorem 1.2 in [Ij (recalled in this paper as 
Proposition |6]) . □ 
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3. Proof of Theorem [T] 

The aim of this section is to prove Theorem [1] i.e., to prove the Brlek-Reutenauer conjec- 
ture under the additional assumption that the defect -D(u) of an infinite word u and the sum 
X]^o '^ui'iT') are finite. As observed in [6j, it is easy to prove the "finite analogy" of the conjecture, 
which deals only with finite words. We will also make use of this result. 

Theorem 8 ([6]). For every finite word w we have 

H 

n=0 

where Tu]{n) = Cw{n + 1) — C^(n) + 2 — Vu]{n + 1) — Vw{n) and the index w means that we 
consider only factors of w . 

It may seem that the Brlek-Reutenauer conjecture for an infinite word u can be obtained from 
Theorem E] by a "limit transition". However, this transition would be far from being kosher. The 
following lemmas enable us to avoid the incorrectness. 

Lemma 9. Let u be an infinite word with language closed under reversal and finite defect. If q 
is its prefix satisfying D{u) = D{q), then for any integer H such that H > \q\ one has 

C-a{H)—V-a{H) = 2^{x € 'C(u) : x is a palindrome shorter than H which is not contained in q}. 

Proof. Let us define a mapping / : 5 — > T, where 

S = {x £ >C(u) : X ^ \x\ < H, x = x} 

and 

T = \^{w,w} : w € £i^(u), w ^ uj} . 

Let X be a palindrome from S and i be the first occurrence of x in u. Put w = Ui_^_^x^_jj ■ ■ ■ u^_^_^x^_l. 
It means that w is a factor of u of length H and x is a suffix of w. Since H > \x\, the factor 
w is not a palindrome - otherwise it contradicts the fact that i is the first occurrence of the 
palindrome x. We put f{x) = {w,w}. 

To show that / is surjective, we consider w G >C//(u) such that w ^W. Let p be the prefix of 
u which ends in the first occurrence of t/; or uJ in u. Since \p\ > H = \w\ > \q\, we have according 
to Corollary |3] that D{q) = D{p) and consequently, lps{p) is unioccurrent in p, which implies 
that lps{p) is not a factor of q. Moreover, lps{p) is shorter than H - otherwise it contradicts the 
choice of the prefix p. We found x = lps{p) G S such that f{x) = {w,w}^ i.e., / is surjective. 

To show that / is injective, we consider two palindromes y,z £ S and we denote f{y) = 
{wy,W^} and f{z) = {wz,vih.}. From the definition of Wx we know that the palindrome x occurs 
as a factor of Wx exactly once, namely as its suffix. It means that x equals lps{wx). Let us 
suppose that f{y) = f{z). We have to discuss two cases. 

(1) Case Wy = Wz- It gives lps{wy) = lps{wz) and thus y = z. 

(2) Case Wy = W^. It implies that y is a prefix of Wz and z is a prefix of Wy. The fact that 
y is a prefix of Wz forces the first occurrence of Wy to be strictly smaller than the first 
occurrence of Wz- Simultaneously, since z is a prefix of Wy, the ffist occurrence of Wz is 
strictly smaller than the ffi'st occurrence oi Wy - a contradiction. 

Consequently, the assumption f{y) = f{z) implies z = y and the mapping / is injective as 
well. 
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Existence of the bijection / between the finite sets T and S means = ^S. Since from 
the definition of T it follows that Cu{H) — Vu{H) = 2^T, the equality stated in the lemma is 
proven. 

□ 

Lemma 10. Let u be an infinite word with language closed under reversal and finite defect. If 
q is its prefix satisfying D{u) = D{q), then for any prefix p of u such that \p\ > \q\ the number 

IpI 

G : X is a palindrome of length at most \q\ which is not contained in q} + ^p(^) 

n=\q\ + l 

equals \p\ — \q\. 

Proof. At first we will show the equality 

(3) \p\-\q\=i^{xeC{p)\C{q):x = x}. 

Let us denote by u^*-* the prefix of u of length i. For any palindrome x € ^{p) \ ^{q) we find the 
minimal index i such that x occurs in u^^\ Since x € J~-{p) \ ^{q), we have \q\ < i < \p\. Thus 
we map any element of {x G J0{p) \ C{q) : x = x} to an index i G {\q\ + l,\q\ + 2, . . . , \p\}. 

Let us look at the details of this mapping. The minimality of i guarantees that x is unioccurrent 
in u^^\ Palindromicity of x gives that x = lps{u^'^^). It implies that no two different palindromes 
are mapped to the same index i, i.e., the mapping is injective. 

Since D(q) = D{u), according to Corollary [3l lps{u^'^^) is unioccurrent in u^*) and thus 
lps{u^^^) ^ C{q)- Thus any index i such that \q\ < i < \p\ has its preimage x = lps{u^^^). 
Therefore the mapping is a bijection and its domain and range have the same cardinality as 
stated in ([3]). 

To finish the proof, we split elements of {x € C{p) \ C{q) : x = x} into two disjoint parts: 
elements of length smaller than or equal to |g| and elements of length greater than \q\. Since 

IpI 

#{x G C{p) \ C{q) : x = x, \x\ > \q\} = #{x G C{p) : x = x, \x\ > \q\} = ^ Vp{n) , 

n=\q\+l 

the statement of Lemma [TO] is proven. 

□ 

Now we can complete the proof of Theorem [TJ 

Proof of Theorem d Finiteness of defect means that there exists a constant L G N such that 
D{u) = D{q) for any prefix g of u which is longer than or of length equal to L. On the other 
hand, finiteness of the sum Yln=o'^^i''^) together with the fact < T^i(n) G Z for any n G N 
implies that there exists a constant M G N such that ru(n) = for any n > M. Let us fix an 
integer H > max{L, M} and denote by q the prefix of u of length \q\ = H — 1. Consequently, 

Tu{n) = for any n > H and -C'(u) = D{q) . 

In order to show the equality ([T]), it thus remains to show 2D{q) = "^2^=0 '^u{n). 

Let us consider a prefix p of u containing all factors of length H. In this case p is longer than 
q, thus it holds by Corollary [3] that D{q) = D{p). Using Theorem El we have 

IpI H-i IpI h-i |p| 

2D{p) = Y,W = Tpin) + ^ T,{n) = + E 

n=0 n=0 n=H n=0 n=H 
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where the last equahty is due to the fact that p contains all factors of length H. It remains to 
prove that Elin^pC") = 0. Let us rewrite the sum by definition. 

ELIh^^W = EHH(Cp(n + l)-Cp(n) + 2-Pp(n+l)-Pp(n)) 
(4) = -Cp{H) + 2i\p\-H + l)-2Z\^l^Vp{n)+VpiH) 

= -C^{H) + 2(1^^1 - i7 + 1) - 2 eLIh Pp{n) + Vu{H), 

where in the last equality we again used the fact that p contains all factors of length H. This 
fact also allows us to rewrite the set {x C{p) : x ^ C{q) , X — X ^ X I < \q\} from Lemma [TO] 
as {x € jC{u) : x ^ 'C(g) , x = x , \x\ < H}. Denote the cardinality of this set by B. 
In this notation, Lemma |9] and Lemma [TO] say 

\p\ 

C^{H) - V^{H) = 2B and B + ^Pp(n) = \p\-H + l. 

n=H 

This implies that the last expression in (|4|) is zero as desired. 

□ 

4. Proof of Theorem [2] 

If an infinite word u is periodic with language closed under reversal, then -D(u) < +oo and 
En^Sj-^uC'^) ^ as shown in [6]. Consequently, we will limit our considerations in the sequel 
to aperiodic words. 

Proposition 11. If ^x is an aperiodic infinite word with language closed under reversal and N 
is an integer, then T^in) = for all n > N if and only if for any factor w such that \w\ > N , 
any factor longer than N beginning in w orw and ending in w or w, with no other occurrences 
of w or w, is a palindrome. 

Proof. (^) : Let us show for any n > N that the assumptions of Lemma [7] are satisfied. We 
have to show two properties of Gn(u) for any n > N. 

(1) Any loop in Gn(u) is a palindrome. 

Since any loop e in G„(u) at a vertex {w,W) is a word beginning in a special factor w or 
w and ending in w or w, with no other occurrences of w or W, the loop e is a palindrome 
by the assumption. 

(2) The graph obtained from G„(u) by removing loops is a tree. 

Or equivalently, we have to show that in Gn(u) there exists a unique path between any 
two different vertices {w',w') and {w",w"). Let p be a factor of u such that w' or w' 
is its prefix, w" or w" is its suffix and p has no other occurrences of w' , w', w" , w". 
Let v be a factor starting in p, ending in w' or w' and containing no other occurrences 
of w' or w'. By the assumption the factor t; is a palindrome, thus p is a suffix of v. It 
is then a direct consequence of the construction of v that the next factor with the same 
properties as p, i.e., representing a path in the undirected graph Gn(u) between w' and 
w", which occurs in u after p, is p. This shows that there is only one such path. 

Consequently, Lemma [7] implies that T„(u) = for any n> N. 

(=^>) : First we prove an auxiliary claim. 

Claim: // u is an aperiodic infinite word with language closed under reversal and N is an integer 
such that T^^{n) = for all n > N, then for any w such that > and any factor v longer 
than N beginning in w and ending in w or w, with no other occurrences of w or w, there exists 
a letter a ^ A such that v has prefix wa and suffix aw. 
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It is clear that repeated application of the previous claim to factors w of length gradually 
increased by one gives the proof of implication (^) of Proposition [TTI 

We split the proof of the auxiliary claim into two cases. 

• Case 1: Assume that w is a special factor. 

If V does not contain any other special factor of length n = \w\ except for w and 
w, then V is a loop in the graph G„(u) and according to Lemma [71 the factor u is a 
palindrome. Necessarily, v begins in wa for some letter a and ends in aw. 

Suppose now that v = vqVi ■ ■ ■ Vm contains a special factor z ^ w,w of length n at 
the position i, i.e., z = fjWj+i • • • Without loss of generality, we consider the 

smallest index i with this property. The pair {z,z) is a vertex in the graph G„(u) and 
a prefix of v, say e, corresponds to an edge in Gn,(u) starting in {w,w) and ending in 
(z,^). Since the graph Gn(u) is a tree, the word v which corresponds to a walk from 
{w,W) to the same vertex {w,w) has a suffix / representing an edge in G„(u) connecting 
again vertices (z,^) and (w,w). It means that the suffix / starts in z or z and ends in w 
or w. Since Gn(u) has no multiple edges connecting distinct vertices, necessarily / = e, 
which already gives the claim. 

• Case 2: Assume w is not a special factor. 

It means that there exists a unique letter a such that wa belongs to the language of 
u. As the language is closed under reversal, the factor w has a unique left extension, 
namely a. If v starts in w and ends in w, then the claim is proven. 

It remains to exclude that v begins and ends in a non-palindromic factor w. Suppose 
this situation happens. In this case, there exists a unique q such that wq is a right special 
factor and it is the shortest right special factor having the prefix w. The factor wq has 
only one occurrence of the factor w - otherwise we can find a shorter prolongation of w 
which is right special. Since is a suffix of v, we deduce that \wq\ < \v\. Because wq is 
the shortest right special factor with prefix w, the factor vq belongs to the language and 
its prefix and suffix wq is a special factor. According to already proven Case 1, we have 
wq = wq = qw. It means together with the inequality \wq\ < \v\ that w is contained in 
V as well - a contradiction. 

□ 

The proof of the implication (=>) of Proposition [11] is taken from |10) . where we showed 
a more general statement for an infinite word whose language is closed under a larger group of 
symmetries. 

Corollary 12. Let u be an aperiodic infinite word with language closed under reversal and let 
N be an integer. IfT^^n) = for all n> N , then the occurrences of w and w in u alternate for 
any factor w of u of length at least N. 

The following lemma builds a bridge between Corollary [5] and Proposition [11] 

Lemma 13. Let u be an aperiodic infinite word with language closed under reversal. There exists 
IL such that the longest palindromic suffix of any prefix w of u of length \w\ > H occurs in 
w exactly once if and only if there exists G N such that for any factor w with \w\ > N , any 
factor longer than N beginning in w orw and ending in w or w, with no other occurrences of w 
or w, is a palindrome. 

Proof. (=^) : We will show that A^ may be set equal to H. Let us proceed by contradiction. 
Suppose there exists a factor w G /^(u) such that \w\ > H and there exists a non-palindromic 
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factor of u beginning in w or w and ending in w or It;, with no other occurrences of w or w. 
Let us find the first non-palindromic factor of the above form in u and let us denote it r. Let 
p be the prefix of u ending in the first occurrence of r in u, i.e., p = tr for some word t and r 
is unioccurrent in p. Denote by s the longest palindromic suffix of p. By the assumption, s is 
unioccurrent in p. No matter how long the suffix s is, we will obtain a contradiction. 

(1) If |s| < \w\, then we have a contradiction to the unioccurrence of s. 

(2) If |r| > \s\ > \w\, then we can find at least 3 occurrences of w or W in r which is a 
contradiction to the form of r. 

(3) The equality |r| = \s\ contradicts the fact that we supposed r to be non-palindromic. 

(4) Finally, if |r| < then there is an occurrence of the mirror image of r which is a non- 
palindromic factor having the same properties as r which occurs before r and contradicts 
the choice of p. 

(<^) : Take a prefix containing all factors of length N. Set H equal to its length. Let us show 
that any prefix p of length greater than or equal to H has lps{p) of length greater than or equal 
to N. Consider a suffix of p of length N, say w. Either w is a palindrome, then lps{p) is of length 
greater than or equal to A^. Or w is not a palindrome, then we find a suffix of p beginning in w 
and containing exactly two occurrences of w or w. Such a suffix exists since all factors of length 
N are contained in p. By assumptions, such a suffix is a palindrome, hence lps{p) is longer than 
N. 

Any prefix p of u of length greater than or equal to H has lps{p) unioccurrent. Assume there 
are more occurrences of lps{p) in p and consider its suffix v starting in the last-but-one occurrence 
of lps{p). Since the length of lps{p) is greater than or equal to A^, the factor v \s & palindrome 
by assumptions, which contradicts the choice of lps{p). □ 

Proof of Theorem\^ For periodic words, the statement was shown in [6]. If u is aperiodic, then 
the statement is a direct consequence of Lemma [131 Corollary HI and Proposition [TT] □ 
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